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The ARCH process (R. F. Engle, 1982) constitutes a paradigmatic generator of stochastic time 
series with time-dependent variance like it appears on a wide broad of systems besides economics in 
which ARCH was born. Although the ARCH process captures the so-called "volatility clustering" 
and the asymptotic power-law probability density distribution of the random variable, it is not 
capable to reproduce further statistical properties of many of these time series such as: the strong 
persistence of the instantaneous variance characterised by large values of the Hurst exponent [H > 
0.8), and asymptotic power-law decay of the absolute values self-correlation function. By means of 
considering an effective return obtained from a correlation of past returns that has a g-exponential 
. . . form (exp [x] = [1 + {1 — q) x] i-<j , {q G 5R), and expj [x] = e^) we are able to fix the limitations 

of the original model. Moreover, this improvement can be obtained through the correct choice of a 

I sole additional parameter, q^- The assessment of its validity and usefulness is made by mimicking 

. daily fluctuations of SP500 financial index. 
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I The time evolution analysis of both physical and non-physical observables plays a central role in nowadays scientific 

. research devoted to complexity. Explicitly, studies on time series with geophysical, meteorological, physiological, and 

' financial origin, amid others, have populated scientific literature particularly during the last two decades [l| . Although 

. each type of system has its own (microscopic) dynamical mechanism, the fact is that certain time series obtained 

' from so dispair systems, like those mentioned above, exhibit common statistical features such as asymptotic power-law 
decaying probability density functions, and a long-lasting power-law-like self- correlation function of the magnitude of 

(— I ' the time series observable, notwithstanding a fast vanishing or null self-correlation function of the variable itself. This 

d [ especial class of time series has usually been associated with stochastic processes which have time-dependent variance, 

^ . mathematically defined as heteroskedasticity, in contrast to the other type of time series, said homoskedastic, that 

' present a constant value for the variance. Customarily, the profile of heteroskedastic time series is also reminiscent of 



on-off intermittency [2], i.e., large values of the variable upon analysis are typically followed by other large values, but 
with an arbitrary sign. Within a financial context, this behaviour has been found in price fluctuations of stocks traded 
^ in financial markets or inflation 3]. In 1982, to further mimic the latter, R.F. Engle introduced the autoregressive 
^ conditional heteroskedasticity [ARCH) process fl. This process is considered as a cornerstone of econometrics fact 
£^ \ that awarded Engle the 2003 Nobel Memorial Prize in Economics "for methods of analysing economic time series 
with time- varying volatility" ^. Albeit its attested expressed in its broad application and generalisations 0], Engle's 
ARCH process is unable to properly reproduce the long-lasting behaviour of the volatility self-correlation function, 
I— 'i because it only leads to an exponential decay of this function In the sequel of this manuscript we propose 
a generalisation of the celebrated ARCH (A) process by introducing a memory kernel emerging from current non- 
^ , extensive statistical mechanics formalism \ r\. As a result, we are able to obtain the asymptotic power-law behaviour 
QQ • of the random variable probability density function, which is notably described by g-Gaussian distributions, and 
to mend the shortcoming of Engle's process. The usefulness of this generalisation is shown by modelling daily 
CnJ ■ fluctuations of S'PSOO financial index. 

■ Following Engle [3], we define an autoregressive conditional heteroskedastic {ARCH) time series as a discrete 
' stochastic process, Zt, 

o ; 

. at LOt, (1) 

o ■ 

1^* • where LOt is an independent and identically distributed random variable with null mean and unitary variance, i.e., 
{ujt) = and (wj ) = 1. Henceforth we call Zt as return. Normally, to is associated with a Gaussian distribution (which 
we have used throughout this work), but other distributions for uj have been presented In the seminal paper of 
$-H ■ reference Qj, it has been suggested a possible dynamics for (hereinafter denominated as squared volatility) defining 
it as a linear function of past squared values of zt, 

s 

al^a + Y.^,zU. {a,h>0). (2) 
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^ Volatility is the financial technical term for instantaneous variance. 



For its linear dependence on eq. ([T]), together with eq. have been coined as ARCH (s) linear process. In 
financial practice, namely price fluctuation modelling, the case s — 1 (bi = b) is, by far, the most studied and applied 
of all ARCH-Mke processes. It can be easily verified, even for all s, that, although {zt Zt') ^ Stt', correlation {\zt\ \zt'\) 
is not proportional to 6tt'- As a matter of fact, it has been proved for s = 1 that, {\zt\ \zt'\) decays as an exponential 
law with characteristic time t = |ln6| ^ , which does not reproduce empirical evidences. In addition, it can be verified 
that, the process is stationary with a stationary variance, a, 

(6>1), (3) 



1-6' 



It has also been proved that, even for large s, the exponential decay of {\zt\ \zt>\) remains (check ref. Q for details). 
Furthermore, the introduction of a large value for parameter s gives rise to implementation problems. In other words, 
when s is large, it is very hard to find a set of {bi}, since it represents the evaluation of a large number of fitting 
parameters ^. Despite instantaneous volatility fluctuation, the AB.CH{1) process is actually stationary and it presents 
a stationary returns probability density function with larger kurtosis than the distribution P{i^). The kurtosis excess is 
precisely the outcome of such time-dependence of at- Correspondingly, when 6 = 0, the process reduces to generating 
a signal with the same PDF of oj, but with a standard variation ^Ja. 

We shall now introduce our variation on the ARCH process. Explicitly, we consider a ARCH {1) process where 
an effective immediate past return, Zt-i, is assumed in the evaluation of a^. By this we mean that we have changed 
eq. © by 

(j1 = a + bz1_^, {a,bi>0), (4) 
in which the effective past return is calculated according to 



^ r I',- _ +^ ,2 



ICii-t) zf, (to < t) , (5) 



where 



with 



exp^ [x] = [l + {l-q) x]]_-' , (7) 

Zq^ {t') = Yl^i=-t' s^P^m [*] {[^]+ — max{0, x} ^). For q ~ —oo, we obtain the standard ARCH (1), and for q — 1, we 
have /C {t') with an exponential form since expj^ [x] — e^ [7]. Although it has a non-normalisable kernel, let us refer that 
the value = oo corresponds to the situation in which all past returns have the same weight, K. {t') = l/(t — to + 1). 
The introduction of an exponential kernel has already been made in [l0| but, as stated therein, it is not able to capture 
the long- lasting correlation in at (or \zt\), at least for financial markets ^. Even though, we surmise that some systems 
(apart those we aim to replicate herein) might have a set of its statistical properties well-described by processes for 
which gm < 1- Considering the process as stationary, it is not difficult to verify that eq. ^ holds. 

Moving ahead on the study of our proposal we have performed numerical realisations, based on eq. ([T]) and eq. 
from which we have analysed the return probability density function (PDF), the Hurst exponent of \zt \ integrated 
signal as well as the \zt\ self-correlation function. In order that our goal is to verify the usefulness of eq. © we have 
kept a = ^. Our option is justified by the fact that a might be eliminated if we define a new variable, z' = z/y/a, for 
which standard deviation becomes equal to 1 (when 6 = 0). Besides, expanding eq. ([3]), 



^ A generalisation of eq. (|2)l, = a + bi 2^_j + Ci cr'^_^ [a,bi,Ci > 0), known as GARCH {s,r) process fsj, was introduced in 

1=1 i = l 

order to have a more flexible structure which could correctly mimic data with a simple GARCH (1, 1) process. However, even this 
process presents an exponential decay for {\zt\ \zt'\), with r = |ln (6 + c) , though condition 6 -|- c < 1 guarantees that GARCH (1,1) 
corresponds exactly to an infinite-order ARCH process. 
^ This condition is known in the literature as Tsallis cut off at x = (1 — g)~^ ■ 

^ A worth mentioning continuous time aproach to price dynamics in stock markets using an exponential kernel was presented in ref. Ill| . 



and considering a continuous time approach in eq. ((T]), we might interpret a as the coefficient that is related to the 
magnitude of additive noise, which does not lead to "fat tails" in p{z), whereas b is associated with the strength of 
multiplicative noise which is responsible for the emergence of tails in p (z) ^ [13]. 

To mathematically describe the returns probability density function we have used the q-Gaussian function 



p (z) = Ae, 



(q < 3) 



(8) 



with B = [cr^ (3 — q)] ^, where, 



z^p{z)]Uz/ / [p{z)]Uz 



is the g-generalised second order moment [Tj], and A is the normalisation constant. For q < 5/3, 
the usual variance according with {3 — q) = (5 — 3q) |T5!|. Distribution ((S]) optimises non-additive (or Tsallis) 
entropy, Sq [l6j , and it is widely applied to describe the PDF of returns in stock market indices and other natural and 
artificial processes which present the properties that we aim to reproduce ^ . In the characterisation of p (z), all PDF 
adjustments have only involved one parameter, the index g, since we have normalised z by the standard deviation and 
we have divided p (z) by p (0) — A. Nevertheless, as we shall see further on, the agreement at the peak is clear-cut. 
On account of difficulties ^ about evaluating truthful values of the self-correlation function, 



{x{t) 



(9) 



we have opted to use the integrated \zt\ time series Hurst exponent, H, obtained from the trustworthy DFA method 
which describes the scaling of the root-mean square, F (r), in signals, F (r) - (0 < < 1) « [1_7]. The results of 
q and H obtained from numerical adjustment procedures are depicted in fig. [T] as functions of b and qm- 
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FIG. 1: Left panel: Entropic index q of eq. ([H versus parameter b for several values of memory index qm- All the numerical 
adjustments to obtain q have a (per degree of freedom) error function of the order of 10~^ and squared correlation coefficient 
^ 0.999. Right panel: Hurst exponent H versus parameter b for several values of memory index qm obtained by DFA 
method. The numerical adjustments we have made present a correlation coefficient R ~ 0.999. The runs that lead to the values 
depicted in both panels have 10® elements. 



^ When the distribution for lu is non-Gaussian, b answers for the increase in the tails of p (z). 

® Within a financial context, distribution lO is usually referred to as t-Student distribution which is equivalent to the g-Gaussian 
distribution for g > 1 as it can be easily checked. 

In the evaluation of Ct (x) the stationarity of the signal is assumed, fact that does not necessarily correspond to its actual nature. 
Another problem is the high sensitivity of Ct {x) to the actual average of x {t). 
® For < H < ^ the signal is anti-persistent and composed by anti-correlations, while for ^ < H < 1 the time series is persistent with 
correlations as strong as higher H is. When H = ^ the time series is a Brownian motion (or white noise) analogue. 



As it is visible from fig.[T] (left panel), for constant qm, q increases monotonically as h also increases. For the same h 
we observe that larger values of qm lead to smaller values of q. In other words, by increasing q™, we augment memory 
in (Tj, hence volatility tends to become less fluctuating. As a consequence, p{z) approaches to distribution, since, as 
we have mentioned above, the time dependence of at is the responsible for emergence of the tails in p (z). This effect 
is perfectly observed when q,-a — oo, for which memory efects are so strong (every single element of the past influences 
the present with the same weight), that after some time steps volatility remains constant. 

Concerning Hurst exponent figures, we would like to refer that they have a bearing on the time interval, r', before 
the crossover into H = ^ regime. Whatever the value of qm we have considered, for values of 6 < 0.75, the crossover 
is visible with a transition r, Tc, which increases as b gets larger. Should time series have highly persistent volatility, 
like price fluctuation ones, the crossover is basically unperceptive within a temporal scale up to r = 10® time steps. 

From the set of numerical results we have estimated the best values of q„i and b which can reproduce statistical 
features of a paragon of the type of time series which we have been referring to — the daily fluctuations of SP500 
financial index [3|. Our S'P500 time series runs from the 3'''' January 1950 up to the 28*'' February 2007 in a total of 
14380 business days. The daily return zt is computed as 

zt = In 5'( - In St-i, 

where St represents the 5*^500 value at time t. As it is usual we have didived z by its standard deviation. Gathering 
together the values of q and H for 5'P500, respectively 1.47 ± 0.02 and 0.88, we have verified that qm = 1.6875 
and b = 0.99635 are able to reproduce, with a remarkable agreement, both the return probability density function 
and the Hurst exponent as it is exhibited on fig. [3 and fig. [3) Further, when we have compared, a posteriori, the 
self-correlation functions of eq. (O, we have verified the same qualitative behaviour. In fact, despite both of the 
short range available for fitting and the fluctuations, a quite similar power-law decay with an exponent of 0.73 ± 0.01 
for our model and 0.71 ± 0.02 for 5*^500 as shown on fig. ID Specifically, and according to fig. [H the two curves stand 
basically side by side for r > 20 in a log — log scale. 




FIG. 2: Left panels; Excerpts of SP500 daily normalised return times series with 2000 elements, from the lO"' September 
1997 to the 25"' October 2005, and model (with = 1.6875 and b = 0.99635) for mere illustration proposes. Right panel; 
Probability density function p{z) versus (normalised) z, obtained from the whole time series shown on left panels, in log-linear 
scale (symbols are used for SP500 PDF and line for model PDF). As it can be seen the accordance is quite good. For S'P500 
ftting q — 1.47 ± 0.2 {x^ — 6 x 10~^ and = 0.99). In both cases z is expressed in standard deviation units. 

To summarise, in this manuscript we have introduced a generalisation of Engle's proposal for generating instan- 
taneous volatility in heteroskedastic processes. This modification refers to the introduction of a memory kernel which 
has an asymptotic power-law dependence defined by a parameter qm ■ Apart from the fact that our alteration has been 
able to reobtain the non-Gaussian PDF for the random variable, zt, it has also been successful about reproducing 
the long-lasting (asymptotic power-law decaying) self-correlation of the magnitude of Zt exhibited by a large number 
of phenomena. The improvement in the reproduction of statistical features of such a kind of time series has been 
achieved by considering just one additional parameter, q™, which represents a clear simplification against ARCH (s) 
(with s 3> 1), that only manage to exhibit a exponential volatility self-correlation function with large characteristic 
time, or other heteroskedastic processes f^. By exhaustive numerical analysis of our model we have found a pair 




FIG. 3: Root-mean square fluctuations, F (r), versus r of the time series of flg.E) The values obtained from numerical fitting 
are H = 0.883 ± 0.005 for 5'P500 (squares) and H = 0.886 ± 0.003 for the model (circles). 




FIG. 4: Absolute returns self-correlation function Ct {\zt\) versus t in log-log scale. The similarity on the qualitative behaviour 
of the two curves is evident. Although the fluctuation, evaluating the decay exponents as a result of a fltting procedure for 
large r we have obtained 0.73 ± 0.01 for the model and 0.71 ± 0.03 for SPSOO. 



of values, qm and 6, with which we have mimicked daily fluctuations of SP500. The resemblance between SP500 
time series and the signal obtained by numerical application of our suggestion is remarkably good for the probability 
density function and the Hurst exponent. In a qualitative sense, the correlation function has also been quite well 
described. The quantitative discrepancies verified in Cr (|^t|) and F (t) might be solved if we modify kernel ^ by 
introducing a sort of "characteristic time", T,i.e., in eq. ^ t' t'/T, as another parameter. 

It is well known that there are an infinity of dynamics whose outcome is the same probability density function. 
However, as far as we are able to obtain an appropriate reproduction of further statistical properties, as it is the case 
we have just presented, we will be approaching our models towards the nature of the system upon study. This is 
certainly important when the models are applied, e.g., on forecasting purposes. It is on this basis we support the 
relevance of our propose. 

In respect of financial markets, and considering a macroscopic approach, our model permit us to say that price 
fluctuations are actually dependent on their history, but on a asymptotically scale-free way [31, as it is exhibited 
by the majority of the so-called complex systems. Such a dependence is in contrast with the usual, and analitycally 
simpler, exponential treatment. On a practical way, this also means that past events take long time to loose their 
importance. 

Last of all, owing to C'r i\zt\) asymptotic power-law decay, as it is visible from eq. ([7]), we could make a correspon- 
dence between the decay exponents and a correlation index, Qc. By this we get qc — 2.37 ± 0.03 for our model and 
Qc = 2.41 ± 0.06 for SP500. Such an association introduces an alternative triplet of entropic indices [19], namely 



{qm,q,qc}^ related to non-extensive statistical mechanics formalism that could characterise this type of systems. 
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